Overview
Brought to you by YData
Dataset statistics
| Number of variables | 6 |
|---|---|
| Number of observations | 28327324 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.5 GiB |
| Average record size in memory | 95.6 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 4 |
id_categoria has constant value "1" | Constant |
liq_um is highly skewed (γ1 = 78.47871768) | Skewed |
liq_um has 433340 (1.5%) zeros | Zeros |
Reproduction
| Analysis started | 2025-10-18 17:27:41.945858 |
|---|---|
| Analysis finished | 2025-10-18 17:30:56.528697 |
| Duration | 3 minutes and 14.58 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
id_categoria
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.3 GiB |
| 1 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 28327324 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 28327324 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 28327324 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 28327324 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 28327324 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 28327324 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 28327324 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 28327324 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 28327324 |
id_cliente
Real number (ℝ)
| Distinct | 97065 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 372401.02 |
| Minimum | 13 |
|---|---|
| Maximum | 727517 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 216.1 MiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 60522 |
| Q1 | 190023 |
| median | 374107 |
| Q3 | 556663 |
| 95-th percentile | 689253 |
| Maximum | 727517 |
| Range | 727504 |
| Interquartile range (IQR) | 366640 |
Descriptive statistics
| Standard deviation | 203824.56 |
|---|---|
| Coefficient of variation (CV) | 0.54732546 |
| Kurtosis | -1.2000749 |
| Mean | 372401.02 |
| Median Absolute Deviation (MAD) | 182834 |
| Skewness | -0.072202653 |
| Sum | 1.0549124 × 1013 |
| Variance | 4.1544451 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 310323 | 5070 | < 0.1% |
| 563742 | 4794 | < 0.1% |
| 317419 | 4738 | < 0.1% |
| 386909 | 4616 | < 0.1% |
| 703385 | 4563 | < 0.1% |
| 397590 | 4541 | < 0.1% |
| 119666 | 4512 | < 0.1% |
| 435032 | 4479 | < 0.1% |
| 245753 | 4426 | < 0.1% |
| 229624 | 4414 | < 0.1% |
| Other values (97055) | 28281171 |
| Value | Count | Frequency (%) |
| 13 | 15 | < 0.1% |
| 33 | 290 | < 0.1% |
| 51 | 1549 | |
| 56 | 452 | < 0.1% |
| 62 | 126 | < 0.1% |
| 64 | 3 | < 0.1% |
| 86 | 278 | < 0.1% |
| 122 | 454 | < 0.1% |
| 148 | 65 | < 0.1% |
| 149 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 727517 | 1 | < 0.1% |
| 727516 | 1 | < 0.1% |
| 727515 | 1 | < 0.1% |
| 727513 | 179 | |
| 727512 | 352 | |
| 727510 | 1 | < 0.1% |
| 727507 | 159 | |
| 727505 | 158 | |
| 727494 | 9 | < 0.1% |
| 727493 | 17 | < 0.1% |
id_periodo
Real number (ℝ)
| Distinct | 84 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 202190.02 |
| Minimum | 201809 |
|---|---|
| Maximum | 202508 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 216.1 MiB |
Quantile statistics
| Minimum | 201809 |
|---|---|
| 5-th percentile | 201902 |
| Q1 | 202010 |
| median | 202206 |
| Q3 | 202401 |
| 95-th percentile | 202504 |
| Maximum | 202508 |
| Range | 699 |
| Interquartile range (IQR) | 391 |
Descriptive statistics
| Standard deviation | 199.64598 |
|---|---|
| Coefficient of variation (CV) | 0.00098741759 |
| Kurtosis | -1.0544741 |
| Mean | 202190.02 |
| Median Absolute Deviation (MAD) | 195 |
| Skewness | -0.11911251 |
| Sum | 5.7275023 × 1012 |
| Variance | 39858.519 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 202212 | 431300 | 1.5% |
| 202312 | 410059 | 1.4% |
| 202412 | 407005 | 1.4% |
| 202303 | 404418 | 1.4% |
| 202112 | 394008 | 1.4% |
| 202211 | 391564 | 1.4% |
| 202109 | 389822 | 1.4% |
| 202502 | 388200 | 1.4% |
| 202401 | 387052 | 1.4% |
| 202203 | 386607 | 1.4% |
| Other values (74) | 24337289 |
| Value | Count | Frequency (%) |
| 201809 | 260451 | |
| 201810 | 252172 | |
| 201811 | 283500 | |
| 201812 | 314043 | |
| 201901 | 298732 | |
| 201902 | 298273 | |
| 201903 | 299924 | |
| 201904 | 280120 | |
| 201905 | 278713 | |
| 201906 | 252741 |
| Value | Count | Frequency (%) |
| 202508 | 325489 | |
| 202507 | 333226 | |
| 202506 | 308601 | |
| 202505 | 337662 | |
| 202504 | 341065 | |
| 202503 | 385217 | |
| 202502 | 388200 | |
| 202501 | 385482 | |
| 202412 | 407005 | |
| 202411 | 364850 |
tipo_mix
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 GiB |
| PREMIUM | |
|---|---|
| MASIVO |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.5508977 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PREMIUM |
|---|---|
| 2nd row | PREMIUM |
| 3rd row | PREMIUM |
| 4th row | PREMIUM |
| 5th row | PREMIUM |
Common Values
| Value | Count | Frequency (%) |
| PREMIUM | 15605459 | |
| MASIVO | 12721865 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| premium | 15605459 | |
| masivo | 12721865 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 43932783 | |
| I | 28327324 | |
| R | 15605459 | 8.4% |
| P | 15605459 | 8.4% |
| E | 15605459 | 8.4% |
| U | 15605459 | 8.4% |
| A | 12721865 | 6.9% |
| S | 12721865 | 6.9% |
| V | 12721865 | 6.9% |
| O | 12721865 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 185569403 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 43932783 | |
| I | 28327324 | |
| R | 15605459 | 8.4% |
| P | 15605459 | 8.4% |
| E | 15605459 | 8.4% |
| U | 15605459 | 8.4% |
| A | 12721865 | 6.9% |
| S | 12721865 | 6.9% |
| V | 12721865 | 6.9% |
| O | 12721865 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 185569403 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 43932783 | |
| I | 28327324 | |
| R | 15605459 | 8.4% |
| P | 15605459 | 8.4% |
| E | 15605459 | 8.4% |
| U | 15605459 | 8.4% |
| A | 12721865 | 6.9% |
| S | 12721865 | 6.9% |
| V | 12721865 | 6.9% |
| O | 12721865 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 185569403 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 43932783 | |
| I | 28327324 | |
| R | 15605459 | 8.4% |
| P | 15605459 | 8.4% |
| E | 15605459 | 8.4% |
| U | 15605459 | 8.4% |
| A | 12721865 | 6.9% |
| S | 12721865 | 6.9% |
| V | 12721865 | 6.9% |
| O | 12721865 | 6.9% |
id_sku_venta
Real number (ℝ)
| Distinct | 520 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 428579.43 |
| Minimum | 515 |
|---|---|
| Maximum | 604926 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 216.1 MiB |
Quantile statistics
| Minimum | 515 |
|---|---|
| 5-th percentile | 7493 |
| Q1 | 450325 |
| median | 450604 |
| Q3 | 450749 |
| 95-th percentile | 451269 |
| Maximum | 604926 |
| Range | 604411 |
| Interquartile range (IQR) | 424 |
Descriptive statistics
| Standard deviation | 104985.77 |
|---|---|
| Coefficient of variation (CV) | 0.24496222 |
| Kurtosis | 12.112201 |
| Mean | 428579.43 |
| Median Absolute Deviation (MAD) | 201 |
| Skewness | -3.5935882 |
| Sum | 1.2140508 × 1013 |
| Variance | 1.1022012 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 450607 | 1159881 | 4.1% |
| 450604 | 981474 | 3.5% |
| 450592 | 918378 | 3.2% |
| 450684 | 914142 | 3.2% |
| 450237 | 907067 | 3.2% |
| 450133 | 852698 | 3.0% |
| 450403 | 749381 | 2.6% |
| 592 | 741858 | 2.6% |
| 450746 | 728508 | 2.6% |
| 450417 | 721352 | 2.5% |
| Other values (510) | 19652585 |
| Value | Count | Frequency (%) |
| 515 | 65937 | 0.2% |
| 566 | 369 | < 0.1% |
| 592 | 741858 | |
| 595 | 130305 | 0.5% |
| 622 | 116009 | 0.4% |
| 763 | 23991 | 0.1% |
| 765 | 80408 | 0.3% |
| 7481 | 14234 | 0.1% |
| 7482 | 12103 | < 0.1% |
| 7483 | 59447 | 0.2% |
| Value | Count | Frequency (%) |
| 604926 | 3169 | < 0.1% |
| 604583 | 2105 | < 0.1% |
| 604582 | 34283 | 0.1% |
| 604581 | 390476 | |
| 604580 | 42379 | 0.1% |
| 604533 | 45 | < 0.1% |
| 604514 | 494 | < 0.1% |
| 604509 | 32340 | 0.1% |
| 604508 | 5513 | < 0.1% |
| 451500 | 7287 | < 0.1% |
liq_um
Real number (ℝ)
Skewed Zeros
| Distinct | 16318 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.93081289 |
| Minimum | -89.46 |
|---|---|
| Maximum | 3096.6533 |
| Zeros | 433340 |
| Zeros (%) | 1.5% |
| Negative | 4395 |
| Negative (%) | < 0.1% |
| Memory size | 216.1 MiB |
Quantile statistics
| Minimum | -89.46 |
|---|---|
| 5-th percentile | 0.0462 |
| Q1 | 0.084 |
| median | 0.21 |
| Q3 | 0.588 |
| 95-th percentile | 2.94 |
| Maximum | 3096.6533 |
| Range | 3186.1133 |
| Interquartile range (IQR) | 0.504 |
Descriptive statistics
| Standard deviation | 7.3722788 |
|---|---|
| Coefficient of variation (CV) | 7.9202586 |
| Kurtosis | 12213.228 |
| Mean | 0.93081289 |
| Median Absolute Deviation (MAD) | 0.1512 |
| Skewness | 78.478718 |
| Sum | 26367438 |
| Variance | 54.350495 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.07896 | 1776867 | 6.3% |
| 0.05544 | 1702889 | 6.0% |
| 0.084 | 1338104 | 4.7% |
| 0.15792 | 1214046 | 4.3% |
| 0.0588 | 912625 | 3.2% |
| 0.168 | 902608 | 3.2% |
| 0.11088 | 780586 | 2.8% |
| 0.05964 | 754101 | 2.7% |
| 0.23688 | 752916 | 2.7% |
| 0.042 | 651926 | 2.3% |
| Other values (16308) | 17540656 |
| Value | Count | Frequency (%) |
| -89.46 | 1 | |
| -63.168 | 1 | |
| -56.8512 | 1 | |
| -52.98216 | 1 | |
| -47.7708 | 1 | |
| -38.34852 | 1 | |
| -35.784 | 1 | |
| -33.3984 | 1 | |
| -33.264 | 1 | |
| -31.584 | 1 |
| Value | Count | Frequency (%) |
| 3096.65328 | 1 | |
| 2949.9456 | 1 | |
| 2527.66752 | 1 | |
| 2349.8496 | 1 | |
| 2336.97912 | 1 | |
| 2128.7616 | 1 | |
| 2021.376 | 1 | |
| 1958.208 | 1 | |
| 1953.4704 | 1 | |
| 1895.04 | 1 |
Interactions
Correlations
| id_cliente | id_periodo | id_sku_venta | liq_um | tipo_mix | |
|---|---|---|---|---|---|
| id_cliente | 1.000 | -0.007 | -0.017 | -0.003 | 0.054 |
| id_periodo | -0.007 | 1.000 | 0.293 | -0.080 | 0.171 |
| id_sku_venta | -0.017 | 0.293 | 1.000 | -0.115 | 0.199 |
| liq_um | -0.003 | -0.080 | -0.115 | 1.000 | 0.010 |
| tipo_mix | 0.054 | 0.171 | 0.199 | 0.010 | 1.000 |
Missing values
A simple visualization of nullity by column.